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(57) Abstract: The present invention relates to a method and device for determining a synchronization fault in a synchronization 
^ network, where values of a clock control parameter associated with the synchronization control in a node are compared with a pre- 
^ dieted mask value for said clock control parameter, and a synchronization fault is determined if one or more of the measured values 
^ lies outside of the predicted mask. 
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Method and device for determining a synchronization fault in 
a network node 

[Background of the Invention] 

5 In the field of telecommunications it is known that the 

switching of signals in a digital network requires that there 
be a form of synchronization for all the nodes of the 
network. In order to accomplish the switching of digital 
signals in a network, the average rates of all the multiplex 

10 signals terminating on a switch in a . node must be 

synchronized within some achievable bound, and the reference 
clock of the switch itself must be synchronized to the common 
rate of the incoming signals. Such a network is therefore 
typically considered as consisting of a traffic network and a 

15 synchronization network. 

Synchronization networks are typically constructed by 
establishing a master-slave hierarchy of so-called stratum 
clocks, where each stratum corresponds to a different level 

2 0 of precision. For example, stratum 1 clocks are normally 
implemented with cesium beam technology of very high 
precision, whereas stratum 2, 3 and 4 clocks have 
progressively lower accuracy. Synchronization references are 
passed from higher performance master clocks to equivalent or 

25 lower performance slave clocks. It is possible to supply more 
than one reference to a given slave clock, where this slave 
clock can then decide which reference to synchronize with. 

Fig. 2 shows a schematic example of a clock system having a 
30 phase locked loop (PLL) structure. More specifically, a 
reference signal 1 is compared with the output 3 from a 
voltage controlled oscillator (VCO) 5 in a comparator 4, and 
the comparator then outputs an appropriate VCO control value 
2 (VCV) to the voltage controlled oscillator 5. 

35 

In order to cope with variations in frequency between 
different nodes, digital switches typically incorporate so- 
called slip buffers, which are able to buffer the data frames 
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if there is a difference between the write clock frequency 
and the read clock frequency. If the frequency difference 
continues for a sufficient amount of time, the slip buffer 
will eventually overflow or underflow, which results in a 
5 packet or frame of data either being repeated or deleted at 
the output of the buffer. Such slip events can have a 
different impact on the traffic network, depending on what 
kind of data is being transported. For example, when 
transporting digital data that employs automatic 
10 retransmission request (ARQ) , a slip will result in requests 
for retransmission and thus degraded throughput. 

[Problems underlying the invention] 

15 When problems occur in the traffic network, such as e.g. a 
degraded throughput, it is important to be able to identify 
the cause of the problems. The problems can be caused by 
specific failures in the traffic network or the 
synchronization network. If failures occur in the 

20 synchronization network, due for example to the creation of 
timing loops, which will lead to a high slip rate, then the 
effects on the traffic network can appear far away from the 
root in the synchronization network, i.e. where the 
synchronization failure occurred. Due to this fact, it is 

25 possible that a network operator will undertake a wrong 
action at a wrong place. 

It is therefore important to provide network operators with a 
means that can assist in finding out if the synchronization 
30 network is the real cause of problems appearing in the 
traffic network, and in which node of the network the 
synchronization is not working. 

The known solution to this object is to check the quality of 
35 the synchronization references with test instruments, and on 
a higher level, to analyze the network synchronization plan. 
One problem with test instruments is that in case of a strong 
impact of the failure (as it could e.g. occur during the 
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millennium shift), it would take a long time and large effort 
to simultaneously check the quality of all of the 
synchronization references in all of the network nodes. This 
is very time and cost consuming in case of failures in wide 
5 area networks (WAN) . Moreover, the use of test instruments is 
only feasible if the network nodes do not branch strongly and 
if the synchronization network is simple and flat. 

[Object of the present invention] 

10 

The object of the present invention is to provide a device 
and method for determining a synchronization fault that is 
applicable to any type of synchronization network, and is 
simple to implement. 

15 

[Summary of the present invention] 

This object is solved by the method of claim 1 and the device 
of claim 14 . Advantageous embodiments are described in the 
20 dependent claims. 

In accordance with the present invention, the actual values 
of a clock control parameter (such as the VCO control value 
VCV in Fig. 2) are measured and compared with the predicted 

25 behavior of said clock control parameter. The predicted 
behavior is based upon a model that takes into account 
internal factors leading to a deviation between the reference 
clock and the internal clock (such as drift and temperature 
variations) , such that a discrepancy between the predicted 

30 behavior and the measured values indicates a synchronization 
fault caused by the synchronization network. 

Although the prediction model can be entirely based on 
theoretical considerations, it is preferable that the 
35 prediction model be based on the collection of significant 
data during a predetermined observation period that precedes 
the measurement period during which the predicted behavior is 
compared with the actual behavior. This collecting of data is 
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done by observing the clock control value in the node or 
nodes e.g. under conditions when the synchronization network 
is either known to be failure- free or can be assumed to be 
failure-free. Then the characteristic values of the model are 
5 determined on the basis of the observed values, to thereby be 
able to make a prediction. Typically this prediction will 
consist in calculating a mask or value range inside of which 
the clock control parameter should lie at a given future 
point in time. Then, if the measured values lies outside of 
10 this mask or value range, it is judged that a synchronization 
fault is present. 

The method of the present invention can be implemented in 
such a way that it is only triggered when a specific failure 

15 event occurs in the traffic network, to thereby be able to 
identify if this is due to a failure in the synchronization 
network, and if yes, in which nodes the synchronization 
failure has occurred, or the present invention can also be 
implemented to operate continuously to thereby be able to 

20 give an alarm or warning as soon as a synchronization failure 
occurs in any given node of the network. 

The various aspects and advantages of the present invention 
shall become more apparent when studying the detailed 
25 embodiments, which shall be described in connection with the 
figures . 
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[Brief description of Figures] 

Fig. 1 shows a flow-chart of a basic embodiment of the 
5 present invention; 

Fig. 2 is a schematic representation of a phase locked 
loop; and 

10 Fig. 3 is a graph that explains the method of the present 
invention. 



[Detailed description of preferred embodiments] 

15 Fig. 1 shows a basic flow chart of an embodiment of the 

present invention, where a value of the VCO control value is 
measured in step SI as VCVi (VCVi represents a sample value 
of VCV(t) , where x represents the absolute time, and this 
sample value can e.g. be determined as VCVi = VCV(i -8), where 

2 0 i is an integer and 6 a sampling period) and then this 

sampled value VCVi is compared with a value range or mask 
having a mask lower limit M^l and a tnask upper limit Mjjl- The 
mask values and are calculated on the basis of a 
prediction model, where these values will generally also be 

25 time dependent, such the value VCV A is compared with mask 
values at the time i *8. If the measured value VCVi lies 
inside of the value range, then it is judged that no 
synchronization fault has occurred (step S3), whereas if the 
value of VCVi lies outside of said value range, then it is 

30 judged that a synchronization fault is present (step S4) . The 
processing after steps S3 or S4 is only shown as a number of 
dots, as this will depend on the specific application, and is 
of no further importance for the present invention. 

35 It may be noted that in the example of Fig. 1, a 

synchronization failure is determined if only one value of 
VCV lies outside of the mask, but it is equally possible that 
the comparison step S2 is modified such that a 
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synchronization fault is only judged to be present if a 
predetermined number of measured values VCV^ fall outside of 
the mask. This can be seen as a mechanism against false 
alarms due to non-recurrent freak values. 

5 

The prediction model used to calculate the mask upper limit 
Mul and mask lower limit M LL can be selected in any desirable 
or appropriate way. For example, it can be based entirely on 
theoretical considerations associated with the clock or 

10 oscillator contained in the node in which the method of the 
invention is employed. Preferably, the model however combines 
theoretical considerations with empirical data to thereby be 
able to provide a simple but reliable way of predicting the 
behavior of the clock control parameter. More specifically, 

15 as shall be explained in detail in the following, it is 
preferable to use a linear approximation of the temporal 
behavior of the clock control parameter. However, it should 
be noted that naturally approximations of higher order or 
different type are equally well applicable. 

20 

Fig. 3 shows a graph, in which the behavior of the VCO 
control value VCV in the course of time r is shown. In order 
to obtain a prediction mask, values of VCV are sampled during 
a first measurement period T measl . The length of the first 

25 measurement period and the rate of taking samples can be 

chosen as is appropriate for the specific application. For 
example, if the temperature at the node is an internal factor 
that influences the VCO control value VCV, then more than one 
sample per day should be collected in order to cover the 

3 0 expected temperature variation period. On the other hand, if 
the equipment is in a location having a controlled 
temperature (air conditioning, etc.), then a lower sample 
rate is possible. In order to cover the drifting 
characteristics of a voltage controlled oscillator, a total 

3 5 measurement period of several weeks is appropriate. 

Additionally, a sufficiently large number of samples should 
be taken, in order to provide a good basis for statistics. 

6 
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The measurement period T measl can be chosen as covering a 
period of time in which the synchronization network is free 
of failure, or in which it can be expected that the 
synchronization network is free of failure. Such a situation 
5 will e.g. occur in connection with the millennium shift, i.e. 
the transition into the year 2000. In other words, the 
measurement period T measl would be chosen to continue up 
until shortly prior to the millennium shift, i.e. until 
December 30 or December 31, 1999. Then the collection of data 
10 for modeling would be stopped and the steps of measuring 

values of VCV and comparing them to the mask range could be 
performed by way of monitoring, or as a response to a failure , 
event in the traffic network. 

15 Now a preferred model for the frequency deviation process 

will be described. The time error TE measured with respect to 
an ideal time generator (ideally locked to a UTC generator 
(UTC = Universal Time Coordinated)) after r seconds is: 

20 TE (r) = ar + 0.5 br 2 + c, 

where the parameter a describes the initial frequency offset 
and phase error due to temperature variation, parameter b 
models the frequency drift of the clock, and parameter c 
25 models phase jumps due to transition (for example when 

entering a holdover mode) . An additional term modeling the 
internal phase noise of the oscillator is considered as 
negligible in comparison with the other parameters. 

30 The fractional frequency deviation (or offset) Y with respect 
to the nominal frequency f n om is defined as: 

Y « (f - f n om) /f nom- 

35 In practice, the nominal frequency is a reference frequency 
with specified accuracy with respect to the actual nominal 
frequency. 
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The fractional frequency deviation can be evaluated by 
differentiating the time error TE: 

Y (r) = a + br. 

5 

In practice, the VCO control value (or any appropriate 
related value) that is used for controlling the PLL (see e.g. 
Fig. 2) or any other similar oscillating controlled process, 
is directly related to the above formula for Y via a constant 
10 k, as in practice it shall accommodate the natural 

oscillating process of the controlled oscillator, in order to 
follow the ideal (under normal conditions) external 
synchronization reference. 

15 The VCO control value VCV is therefore: 

VCV =kxY (r) = k (a+br). 

In order to evaluate the linear term, a first derivative 
20 A (VCV) can be used, where 5 will refer to the time interval 
between two samples VCV^ in the following: 

Aj (VCV) « VCVi +1 - VCVi = k(ai +1 + b(i + l)£) - k(ai +biJ) = 
k(ai +1 - ai) +b£. 

25 

By averaging the values Aj (VCV), the part relating to k 
( a i+l " a i) can be averaged away to zero, as it can be 
assumed that the variations due to temperature and other 
ambient causes have a zero mean value, and the parameter b 
30 can be evaluated. 

By comparing the term bS with the VCVi values, a maximum 
expected value for the parameter a can be determined. More 
specifically, on the basis of the determined value b, the 
35 values can be averaged to provide a value a, or the 

maximum value from all encountered values a^ can be selected 
as the value a for the model. Naturally, any other suitable 
evaluation of the values a^ is equally well possible. 



8 



WO 01/20828 



PCT/EP00/08599 



With the above-mentioned parameters a and b, a prediction 
value range or prediction mask for VCV can be constructed. 
This mask, which is indicated in Fig. 3 as being centered 
5 around kbr with a width of 2ka, is used as a reference 

during a second measurement period T meas2 , where the method 
shown in Fig. 1 is performed during said second measurement 
period T meas 2 . In other words, during said second measurement 
period values of VCV are measured, where it should be noted 

10 that these values VCVi shall be sampled at a higher rate than 
during the first measurement period T measl . Then, if a 
predetermined number of values VCVi (e.g. one) lie outside of 
the mask, i.e. VCVi > M UL or VCVi < M LL , then an alarm signal 
is output that indicates a synchronization fault and 

15 identifies the node in which the synchronization fault has 
been detected. As already mentioned previously, the second 
measurement period T meas 2 can be triggered by the occurrence 
of a failure in the traffic network (as indicated in Fig. 3), 
or can be triggered regularly or periodically as a means for 

20 monitoring the behavior of the clock control parameter in 
order to immediately be able to identify a network 
synchronization failure when it occurs. 

In the case of Fig. 3, the width of the mask is determined by 
25 a parameter a that results from the measurements during the 
first measurment period T^^. It is however equally well 
possible to only deduce the parameter b from the 
measurements, in order to provide the time dependent factor 
kbx, and to select a constant value for the width of the 
30 mask. 

As already mentioned earlier, it is possible to select the 
first measurement period T meas i and the second measurement 
period T Tneas 2 in view of a specific event that can be 
35 expected to cause problems in the network, such as the 

millennium transition from December 31, 1999 to January 1, 
2000. For example, the first measurement period could be 
chosen as one month from December 1, 1999 to December 31, 

9 
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1999 at 11 p.m. , and the second measurement period could be 
chosen as from 11.30 p.m. on December 31, 1999 to 12 noon on 
January 1, 2000. 

5 However, it is also possible to implement the method and 
device of the present invention such that a continuous 
measuring with respect to the first measurement period is 
conducted, to thereby achieve a continuous updating of the 
model parameters, e.g. the parameters a, b described above. 

10 For this purpose, the method of the invention is implemented 
in such a way that the length of the measurement period 
T measl ^ s fi xed to a given value, e.g. one week, and a given 
mask or value range is associated with each measurement 
period T meas i^. The mask or value range is identified by the 

15 relevant parameters, such as a, b derived from the 

corresponding measurement period. Then, after the expiration 
of the given measurement period T^agi*, a new measurement 
period T measl i +1 is started for the purpose of updating the 
parameters a, b. During this subsequent period T measl i+1 , the 

20 determination of a synchronization fault in accordance with 
the method described in connection with Fig. 1 is conducted 
on the basis of the mask associated with measurement period 
Tmeasl"*' • Then, when the period T measl i + 1 has expired, the 
values of a, b are updated in accordance with the results of 

25 said measurement period, and a further measurement period 

T meas ^i + 2 i s started. This process is then continued as long 
as the continuous monitoring and updating is desired. 

In other words, the method is implemented such that each 
30 first measurement period T^ag^* that is used to determine a 
mask value (e.g. in terms of parameter a, b) is associated 
with a second measurement period T meas2 ^ during which the 
measured values of VCV are compared with the mask associated 
with T measl i , where T meas2 i begins after T measl i has 
35 expired. However, as a new first measurement period T measl i+1 
begins after Tmeag! 1 , the second measurement period T meas2 i 
will typically fall into this subsequent first measurement 
period T meas ii +1 . As already mentioned, the mask to be used 
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during the period T meaS 2 i will be the one determined on the 
basis of the measurement during T^asi*. However, as a 
measure of security, if the time difference between the 
occurence of the condition that triggers the begin of T meaS 2 1 
5 (such as a predetermined fault condition in the traffic 

network, or a periodically recurring trigger signal) and the 
end of Tmeasi 1 (i.e. the most recent first time period for 
which model parameters for mask determination were stored) is 
smaller than a predetermined limit, then the step of 
10 comparing values VCV with the mask will be done on the basis 
of the mask associated with the measurement period T measl i " 1 , 
i.e. the first measurement period immediately preceding 

T measl a • 

15 This security measure is especially important if the begin of 
the second measurement period T meaS 2 i cannot be determined in 
advance, e.g. if it is triggered by a failure event in the 
traffic network. If the distance is too short, there is a 
danger that the mask is already influenced by the failure. 

20 Naturally, if the second measurement periods T me as2 1 are 
started automatically and regularly, then the time span 
between the end of T^agx 1 and the begin of T meas2 1 can be 
adjusted appropriately. 

25 In the following, a detailed example of the application of 

the present invention to a system clock of a digital exchange 
when preparing for the millennium shift will be given. 

The control value of the system clock of the digital exchange 
30 has the default value of 2048, and it changes in units of 300 

for every ppm of frequency adjustment. If the system clock is 

an ANSI stratum 3E clock, then it typically drifts no more 

than 10~ 3 ppm per day, i.e. 3 control units every 10 days. 

The maximum variation due to temperature variations should be 
35 less than 10~ 2 ppm, but in a temperature controlled 

environment we should expect a much better behavior, e.g. 0,3 

x 10" 2 ppm corresponding to 1 control unit. 

11 
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The first measurement period T meas i should start on December 
1, 1999. After 30 days measurement, the value could be e.g. 9 
+/- 1 control unit over/under the starting point. Whatever it 
will be, a mask can be defined based on the collected data as 
5 described previously. If something happens (e.g. a timing 

loop) in the synchronization network, and the synchronization 
reference changes suddenly, such that the frequency changes 
by a quantity of e.g. 10"" 1 ppm, slips will occur with the 
periodicity of about 1 every 2 0 minutes. After a few hours, 
10 e.g. 10 hours, this could create an unacceptable performance 
degradation in the network, e.g. if the traffic network is a 
signaling network, it could stop working. 

The automatic check procedure triggered by this degradation 
15 will then start in all of the network nodes, finding the 
system clock control value at a value that is well outside 
the expected (predicted) mask only in those nodes where the 
synchronization is actually degraded. On the other hand, if 
the cause of errors in the network (e.g. congestion) is not 

2 0 due to synchronization problems, no mask will be exceeded in 

the network, and it will thereby be clear that the problems 
in the traffic network are not due to problems in the 
synchronization network, and consequently no false action 
will be taken, at least not with respect to the 
25 synchronization network. 

As already mentioned, the check procedure during the second 
measurement period T meaS 2 could alternatively be started 
before any failure is found, in order to prevent the failure 

3 0 from having a large effect, as it could be immediately 

detected. 

The above -described method and device has a number of 
advantages. For example, a fault or failure in a network 
3 5 element due to bad synchronization can be found quickly and 
with little effort. Also, the above-described method is 
particularly suited for determining errors in a 
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synchronization network when problems can be expected at a 
certain point in the future (e.g. the millennium shift) . 

Although the present invention has been described on the 
basis of specific embodiments, these embodiments are only- 
presented to more clearly describe the character of the 
invention to the person skilled in the art, and are not to be 
seen as limiting the present invention, which is defined in 
its scope by the appended claims. 
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[Claims] 

1. A method for determining a synchronization fault in at 
5 least one node of a network, said network comprising a 

traffic network and an associated synchronization 
network, where said synchronization network supplies 
network synchronization information to said node, said 
method comprising: 

10 - measuring (SI) values of a clock control parameter 

(VCV) associated with synchronization control in said 
node, said clock control parameter (VCV) depending on 
internal factors associated with said node and on said 
network synchronization information, 

15 - comparing (S2) said measured values with a value range 

calculated on the basis of a prediction model of said 
clock control parameter (VCV) , in order to determine if 
said measured values lie in said value range, where said 
prediction model takes into account said internal 

20 factors, and 

- determining (S4) a synchronization fault in said node 
if a predetermined number of measured values lies 
outside of said value range. 

25 2. The method according to claim 1, characterized in that 
said method is implemented to continuously perform the 
procedure of determining a synchronization fault in one 
or more of the nodes of said network, where an alarm 
signal identifying a node with a synchronization fault 

3 0 is output if a synchronization fault is determined. 

3. The method according to claim 1, characterized in that 
said method is implemented to perform the procedure of 
determining a synchronization fault in all of the nodes 
35 of the network if a predetermined failure event occurs 

in the traffic network, where an alarm signal 
identifying a node with a synchronization fault is 
output if a synchronization fault is determined. 
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4. The method according to one of claims 1 to 3, 

characterized in that said clock control parameter (VCV) 
is a control parameter for a local oscillator (VCO) . 

5 

5- The method according to claim 4, characterized in that 
said oscillator is a voltage controlled oscillator in a 
phase locked loop, and said clock control parameter 
(VCV) is the control voltage of said oscillator. 

10 

6. The method according to claim 4 or 5, characterized in 
that one of said internal factors is the drift of said 
oscillator. 



15 7. The method according to one of claims 1 to 6, 

characterized in that one of said internal factors is 
the temperature . 

8. The method according to claim 1, characterized in that 
20 said prediction model is based on observing said clock 

control parameter (VCV) over a predetermined period of 
time (T^gi) under fault-free operation conditions of 
said synchronization network. 

25 9. The method according to claim 8, characterized in that 
said method is implemented such that said predetermined 
period of time (T^^) lies prior to a point in time at 
which a synchronization fault is expected, and said 
determining of a synchronization fault is performed 

30 during a second period of time (T^^) in which said 

point in time lies. 

10. The method according to claim 1, characterized in that 

said prediction model is based on continuously observing 
35 said clock control parameter (VCV) , where after each 

lapse of a predetermined time interval (T^^ 1 ) , new 
model parameters (a,b) for calculating a value range to 
be used in said comparison step is calculated and 
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stored, said new model parameters (a,b) being associated 
with the most recently lapsed time interval. 

11. The method according to claim 10, characterized in that 
5 said step of comparing measured values with a value 

range is triggered by the occurrence of a predetermined 
condition in said network, where the value range used in 
said comparison step is calculated from the most 
recently stored model parameters (a,b), except if the 

10 time difference between the occurrence of said 

predetermined condition and the end of the time interval 
with which said most recently stored model parameters 
(a,b) is associated is smaller than a predetermined 
limit, in which case the stored model parameters 

15 immediately preceding said most recently stored model 

parameters are used for determining the value range used 
in said comparison step. 

12. The method according to claim 11, characterized in that 
20 said predetermined condition is a prespecified failure 

event in said traffic network. 

13. The method according to claim 11, characterized in that 
said predetermined condition is a regularly recurring 

25 trigger signal, such that said comparison step is 

performed regularly. 

14. A fault determination device for a node of a network, 
said network comprising a traffic network and an 

30 associated synchronization network, comprising: 

- a synchronization control device that is arranged to 
receive network synchronization information from said 
synchronization network and to synchronize actions in 
said node with said network synchronization information, 

35 comprising a clock device that is controlled by a clock 

control parameter (VCV) , said clock control parameter 
(VCV) depending on internal factors associated with said 
node and on said network synchronization information, 
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- a measuring device for measuring values of said clock 
control parameter (VCV) , 

- a calculating device for calculating a value range on 
the basis of a prediction model of said clock control 
parameter (VCV) , 

- a comparing device for comparing said measured values 
with said value range, in order to determine if said 
measured values lie in said value range, and 

- a fault determining device for determining a 
synchronization fault in said node if a predetermined 
number of measured values lies outside of said value 
range . 
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